Search CORE

22 research outputs found

DivGraphPointer: A Graph Pointer Network for Extracting Diverse Keyphrases

Author: Bougouin Adrien
Frank Eibe
Glorot Xavier
Ioffe Sergey
Kim Su Nam
Kim Youngsam
Kingma Diederik
Mihalcea Rada
Mikolov Tomáš
Qazvinian Vahed
Wan Xiaojun
Wang Yining
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 19/05/2019
Field of study

Keyphrase extraction from documents is useful to a variety of applications such as information retrieval and document summarization. This paper presents an end-to-end method called DivGraphPointer for extracting a set of diversified keyphrases from a document. DivGraphPointer combines the advantages of traditional graph-based ranking methods and recent neural network-based approaches. Specifically, given a document, a word graph is constructed from the document based on word proximity and is encoded with graph convolutional networks, which effectively capture document-level word salience by modeling long-range dependency between words in the document and aggregating multiple appearances of identical words into one node. Furthermore, we propose a diversified point network to generate a set of diverse keyphrases out of the word graph in the decoding process. Experimental results on five benchmark data sets show that our proposed method significantly outperforms the existing state-of-the-art approaches.Comment: Accepted to SIGIR 201

arXiv.org e-Print Archive

Crossref

Deep specification mining

Author: Arp Daniel
Hindle Abram
James
Knight John C.
Mikolov Tomas
Mikolov Tomáš
Sutskever Ilya
Tien-Duy
White Martin
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 01/07/2018
Field of study

Singapore National Research Foundatio

Crossref

Institutional Knowledge at Singapore Management University

STATISTICAL LANGUAGE MODELS BASED ON NEURAL NETWORKS

Author: Mikolov Tomáš
Publication venue: Vysoké učení technické v Brně. Fakulta informačních technologií
Publication date: 01/01/2012
Field of study

Digital library of Brno University of Technology

National Repository of Grey Literature

STATISTICAL LANGUAGE MODELS BASED ON NEURAL NETWORKS

Author: Mikolov Tomáš
Publication venue: Vysoké učení technické v Brně. Fakulta informačních technologií
Publication date
Field of study

Statistické jazykové modely jsou důležitou součástí mnoha úspěšných aplikací, mezi něž patří například automatické rozpoznávání řeči a strojový překlad (příkladem je známá aplikace Google Translate). Tradiční techniky pro odhad těchto modelů jsou založeny na tzv. N-gramech. Navzdory známým nedostatkům těchto technik a obrovskému úsilí výzkumných skupin napříč mnoha oblastmi (rozpoznávání řeči, automatický překlad, neuroscience, umělá inteligence, zpracování přirozeného jazyka, komprese dat, psychologie atd.), N-gramy v podstatě zůstaly nejúspěšnější technikou. Cílem této práce je prezentace několika architektur jazykových modelůzaložených na neuronových sítích. Ačkoliv jsou tyto modely výpočetně náročnější než N-gramové modely, s technikami vyvinutými v této práci je možné jejich efektivní použití v reálných aplikacích. Dosažené snížení počtu chyb při rozpoznávání řeči oproti nejlepším N-gramovým modelům dosahuje 20%. Model založený na rekurentní neurovové síti dosahuje nejlepších publikovaných výsledků na velmi známé datové sadě (Penn Treebank).Statistical language models are crucial part of many successful applications, such as automatic speech recognition and statistical machine translation (for example well-known Google Translate). Traditional techniques for estimating these models are based on Ngram counts. Despite known weaknesses of N-grams and huge efforts of research communities across many fields (speech recognition, machine translation, neuroscience, artificial intelligence, natural language processing, data compression, psychology etc.), N-grams remained basically the state-of-the-art. The goal of this thesis is to present various architectures of language models that are based on artificial neural networks. Although these models are computationally more expensive than N-gram models, with the presented techniques it is possible to apply them to state-of-the-art systems efficiently. Achieved reductions of word error rate of speech recognition systems are up to 20%, against stateof-the-art N-gram model. The presented recurrent neural network based model achieves the best published performance on well-known Penn Treebank setup.

Digital library of Brno University of Technology

Language Modeling for Spech Recognition in Czech

Author: Mikolov Tomáš
Publication venue: Vysoké učení technické v Brně. Fakulta informačních technologií
Publication date
Field of study

This work concerns the problematic of language modeling in automatic speech recognition. Currently widely used techniques for advanced language modeling based on statistical approach are described in the first part of work - class based language models, factored language models and neural network based language models. In the next section, implementation of neural network based language model is described. Results obtained on "Pražský mluvený korpus" and "Brněnský mluvený korpus" corpora (1 170 000 words) are reported, with perplexity reduction around 20%. Also, results obtained after rescoring N-best lists with spontaneous speech are reported, with absolute improvement in accuracy by more than 1%. In the conclusion, possible uses of the work are mentioned, along with possible extensions in the future. Finally, main weaknesses of current statistical language modeling techniques are described

National Repository of Grey Literature

Extensions of recurrent neural network language model.

Author: Honza &quot
Jan
Lukáš Burget
Sanjeev Khudanpur
Stefan Kombrink
Tomáš Mikolov
Publication venue
Publication date: 01/01/2011
Field of study

ABSTRACT We present several modifications of the original recurrent neural network language model (RNN LM). While this model has been shown to significantly outperform many competitive language modeling techniques in terms of accuracy, the remaining problem is the computational complexity. In this work, we show approaches that lead to more than 15 times speedup for both training and testing phases. Next, we show importance of using a backpropagation through time algorithm. An empirical comparison with feedforward networks is also provided. In the end, we discuss possibilities how to reduce the amount of parameters in the model. The resulting RNN model can thus be smaller, faster both during training and testing, and more accurate than the basic one

CiteSeerX

A Data-Driven Misbehavior Detection System for Connected Autonomous Vehicles

Author: Alvarez Alberto Diaz
Bai Fan
Box Joan Fisher
Krajzewicz Daniel
Krauß Stefan
Lefèvre Stéphanie
Mikolov Tomáš
Xu Li
Zaidi Kamran
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date
Field of study

Crossref

Meta-path Augmented Sequential Recommendation with Contextual Co-attention Network

Author: Cheng Chen
Du Yong
Feng Shanshan
Gao Lianli
Gehring Jonas
Hu Binbin
Le Quoc
Lin Zhouhan
Liu Duen-Ren
Liu Jian
Liu Qiang
Mikolov Tomas
Mikolov Tomáš
Rendle Steffen
Shen Tao
Shi Chuan
Vaswani Ashish
Xu Kelvin
Zhang Yongfeng
Zhang Yuyu
Zheng Lei
Zhou Chang
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date
Field of study

Crossref